Picture for Vasu Sharma

Vasu Sharma

Findings of the Counter Turing Test: AI-Generated Image Detection

Add code
May 21, 2026
Viaarxiv icon

Mechanistic origins of catastrophic forgetting: why RL preserves circuits better than SFT?

Add code
May 21, 2026
Viaarxiv icon

Playing Devil's Advocate: Off-the-Shelf Persona Vectors Rival Targeted Steering for Sycophancy

Add code
May 20, 2026
Viaarxiv icon

ProMoral-Bench: Evaluating Prompting Strategies for Moral Reasoning and Safety in LLMs

Add code
Feb 05, 2026
Viaarxiv icon

The Llama 4 Herd: Architecture, Training, Evaluation, and Deployment Notes

Add code
Jan 15, 2026
Viaarxiv icon

Reasoning Relay: Evaluating Stability and Interchangeability of Large Language Models in Mathematical Reasoning

Add code
Dec 16, 2025
Viaarxiv icon

WOLF: Werewolf-based Observations for LLM Deception and Falsehoods

Add code
Dec 09, 2025
Viaarxiv icon

SALT: Steering Activations towards Leakage-free Thinking in Chain of Thought

Add code
Nov 11, 2025
Viaarxiv icon

A Comprehensive Dataset for Human vs. AI Generated Text Detection

Add code
Oct 26, 2025
Figure 1 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Figure 2 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Figure 3 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Figure 4 for A Comprehensive Dataset for Human vs. AI Generated Text Detection
Viaarxiv icon

ERGO: Entropy-guided Resetting for Generation Optimization in Multi-turn Language Models

Add code
Oct 15, 2025
Viaarxiv icon